Proteomics Report: DIA Benchmark Experiment

DE Analysis

Author

Andrea

Experiment

Yeast as background and USP2 proteins spiked in different concentration. This Experiment is designed for DIA benchmarking of different workflow using DIA-NN.

Import data

Reading DIA data from data/report.tsv using Precursor.Translated intensities from DIA-NN.

All the precursors with Q-value < 0.01 are used for the analysis.

Filtering steps

The pre-processing steps are:

  • Filtering precursor with more than 3 samples with no missing intensities across all the samples
  • Select only proteotypic peptides
  • Select proteins with at least 3 peptides
  • Contaminats proteins are not filtered

Data Missing Analysis

The plot shows the completeness of the experiments at precursor and summarized level. On the the x-axis peptides/proteins are ordered based on the increasing NA content.

Normalization

Raw intensities from DIA-NN are processed as follow:

  • Log 2 transformation
  • Normalized across all the samples using quantiles method
  • Summarization at protein level using medianPolish function

PCA

DE Analysis

Using MSqRob2 with the following formula ~ -1 + Group with ridge regression disabled

  GroupA   GroupB   GroupC   GroupD   GroupE 
14.98936 14.77959 14.88881 15.01346 14.95730 

QC plots

Group Comparison

This volcano plot summarizes the differential expression landscape in the comparison between the two groups

These bar plots summarizes the number of significantly upregulated/downregulated number of proteins based on different adjusted p-values (selected adjusted p-values are 0.001, 0.01, 0.05, and 0.1 - see facet headers) and log2 fold-change thresholds (on the x-axis) used to define the significance levels.

This volcano plot summarizes the differential expression landscape in the comparison between the two groups

These bar plots summarizes the number of significantly upregulated/downregulated number of proteins based on different adjusted p-values (selected adjusted p-values are 0.001, 0.01, 0.05, and 0.1 - see facet headers) and log2 fold-change thresholds (on the x-axis) used to define the significance levels.

Summary of the DE proteins

In the table you can find a summaries of the number of DE proteins found in all the comparisons. The overlapping of DE proteins among the comparisons is also showed using an upset plot.

Session Info

R version 4.3.3 (2024-02-29 ucrt)
Platform: x86_64-w64-mingw32/x64 (64-bit)
Running under: Windows 10 x64 (build 19045)

Matrix products: default


locale:
[1] LC_COLLATE=English_Belgium.utf8  LC_CTYPE=English_Belgium.utf8   
[3] LC_MONETARY=English_Belgium.utf8 LC_NUMERIC=C                    
[5] LC_TIME=English_Belgium.utf8    

time zone: Europe/Brussels
tzcode source: internal

attached base packages:
[1] stats4    stats     graphics  grDevices utils     datasets  methods  
[8] base     

other attached packages:
 [1] reshape2_1.4.4              heatmaply_1.5.0            
 [3] viridis_0.6.5               viridisLite_0.4.2          
 [5] pheatmap_1.0.12             UpSetR_1.4.0               
 [7] gridExtra_2.3               hrbrthemes_0.8.7           
 [9] factoextra_1.0.7            MSnbase_2.28.1             
[11] ProtGenerics_1.34.0         mzR_2.36.0                 
[13] Rcpp_1.0.12                 DT_0.33                    
[15] plotly_4.10.4               here_1.0.1                 
[17] magrittr_2.0.3              lubridate_1.9.3            
[19] forcats_1.0.0               stringr_1.5.1              
[21] purrr_1.0.2                 readr_2.1.5                
[23] tidyr_1.3.1                 tibble_3.2.1               
[25] tidyverse_2.0.0             dplyr_1.1.4                
[27] ggrepel_0.9.5               ggplot2_3.5.1              
[29] visdat_0.6.0                readxl_1.4.3               
[31] msqrob2_1.10.0              QFeatures_1.12.0           
[33] MultiAssayExperiment_1.28.0 SummarizedExperiment_1.32.0
[35] Biobase_2.62.0              GenomicRanges_1.54.1       
[37] GenomeInfoDb_1.38.8         IRanges_2.36.0             
[39] S4Vectors_0.40.2            BiocGenerics_0.48.1        
[41] MatrixGenerics_1.14.0       matrixStats_1.3.0          

loaded via a namespace (and not attached):
  [1] splines_4.3.3           later_1.3.2             bitops_1.0-7           
  [4] cellranger_1.1.0        preprocessCore_1.64.0   XML_3.99-0.16.1        
  [7] lifecycle_1.0.4         rstatix_0.7.2           doParallel_1.0.17      
 [10] rprojroot_2.0.4         processx_3.8.4          lattice_0.22-6         
 [13] MASS_7.3-60.0.1         crosstalk_1.2.1         dendextend_1.17.1      
 [16] backports_1.4.1         limma_3.58.1            sass_0.4.9             
 [19] rmarkdown_2.26          jquerylib_0.1.4         yaml_2.3.8             
 [22] httpuv_1.6.15           MsCoreUtils_1.14.1      minqa_1.2.6            
 [25] RColorBrewer_1.1-3      abind_1.4-5             zlibbioc_1.48.2        
 [28] AnnotationFilter_1.26.0 RCurl_1.98-1.14         seriation_1.5.5        
 [31] gdtools_0.3.7           GenomeInfoDbData_1.2.11 crul_1.4.2             
 [34] ncdf4_1.22              codetools_0.2-20        DelayedArray_0.28.0    
 [37] tidyselect_1.2.1        httpcode_0.3.0          farver_2.1.2           
 [40] lme4_1.1-35.3           TSP_1.2-4               webshot_0.5.5          
 [43] jsonlite_1.8.8          iterators_1.0.14        systemfonts_1.1.0      
 [46] foreach_1.5.2           tools_4.3.3             glue_1.7.0             
 [49] Rttf2pt1_1.3.12         SparseArray_1.2.4       BiocBaseUtils_1.4.0    
 [52] xfun_0.43               ca_0.71.1               withr_3.0.0            
 [55] BiocManager_1.30.23     fastmap_1.1.1           boot_1.3-30            
 [58] fansi_1.0.6             callr_3.7.6             digest_0.6.35          
 [61] timechange_0.3.0        R6_2.5.1                mime_0.12              
 [64] colorspace_2.1-0        utf8_1.2.4              generics_0.1.3         
 [67] fontLiberation_0.1.0    data.table_1.15.4       httr_1.4.7             
 [70] htmlwidgets_1.6.4       S4Arrays_1.2.1          pkgconfig_2.0.3        
 [73] gtable_0.3.5            registry_0.5-1          impute_1.76.0          
 [76] XVector_0.42.0          htmltools_0.5.8.1       fontBitstreamVera_0.1.1
 [79] carData_3.0-5           MALDIquant_1.22.2       clue_0.3-65            
 [82] scales_1.3.0            knitr_1.46              tzdb_0.4.0             
 [85] nlme_3.1-164            curl_5.2.1              nloptr_2.0.3           
 [88] cachem_1.0.8            parallel_4.3.3          extrafont_0.19         
 [91] mzID_1.40.0             vsn_3.70.0              pillar_1.9.0           
 [94] grid_4.3.3              vctrs_0.6.5             pcaMethods_1.94.0      
 [97] promises_1.3.0          ggpubr_0.6.0            car_3.1-2              
[100] xtable_1.8-4            cluster_2.1.6           extrafontdb_1.0        
[103] evaluate_0.23           cli_3.6.2               compiler_4.3.3         
[106] rlang_1.1.3             crayon_1.5.2            ggsignif_0.6.4         
[109] labeling_0.4.3          ps_1.7.6                affy_1.80.0            
[112] plyr_1.8.9              stringi_1.8.3           BiocParallel_1.36.0    
[115] assertthat_0.2.1        munsell_0.5.1           lazyeval_0.2.2         
[118] fontquiver_0.2.1        Matrix_1.6-5            hms_1.1.3              
[121] gfonts_0.2.0            statmod_1.5.0           shiny_1.8.1.1          
[124] igraph_2.0.3            broom_1.0.5             affyio_1.72.0          
[127] bslib_0.7.0